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ABSTRACT 


The fundamental control synthesis issue of establishing a priori convergence rates 
of approximation schemes for feedback controllers for a class of distributed parameter 
systems is addressed within the context of hereditary systems. Specifically, a 
factorization approach is presented for deriving approximations to the optimal feedback 
gains for the linear regulator- quadratic cost problem associated with time-varying 
functional differential equations with control delays. The approach is based on a 
discretization of the state penalty which leads to a simple structure for the feedback 
control law. General properties of the Volterra factors of Hilbert- Schmidt operators 
are then used to obtain convergence results for the controls, trajectories and feedback 
kernels. Two algorithms are derived from the basic approximation scheme, including a 
fast algorithm, in the time-invariant case. A numerical example is also considered. 
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1. INTRODUCTION . This report represents a first application of the control synthesis 
technique based on factorization methods developed in [24]. The main significance of 
the results contained herein stems from our ability to obtain a detailed analysis of a 
priori convergence rates for numerical approximation of feedback controllers for a 
class of distributed parameter systems. Although in the past 15 years much work has 
been done in developing approximation schemes for various distributed controllers, the 
important issue of convergence rates has been largely neglected. This is due primarily 
to the fact that it is a difficult problem, and control researchers have been content 
with the first order assessment regarding whether or not their approximation scheme 
converges at all (in certain correct senses). In this report convergence rates are 
established in the context of a challenging class of problems that include control delay 
terms. The underlying methods are of particular importance since they more tightly 
connect the modeling and control synthesis problems, and hence, have application to 
other distributed parameter systems. 

The particular problem we address is the computation of the optimal feedback 
gain for the finite time linear regulator quadratic cost problem for systems governed by 
retarded functional differential equations (RFDE) with control delays. Feedback 
control laws for these systems have been previously derived for both the finite and 
infinite time problems in several articles under various hypotheses (see for example [6], 
[13], [14], [17], [24], [25]). A common approach in several of these articles is to relate 
the RFDE with control delays to an evolution equation in an appropriate state space. 
The feedback control law then arises in the familiar form as a solution to an operator 
Riccati equation. However, when point delays appear in the control, an unbounded 
input operator results and this operator then appears in the quadratic term of the 
Riccati equation. The presence of this unbounded term would appear to complicate any 
analysis of approximation schemes for the feedback gains. 

The approach in [24] utilizing factorization theory [11], [22], allows us to 
circumvent these difficulties. The interest here is to further pursue the approach of 
[24] to the problem of approximating the optimal feedback kernel and deriving 
convergence estimates for the approximations, and the resulting controls and 
trajectories. (We note some other applications of factorization include for example, 
filtering and smoothing of nonstationary processes over a finite interval [15], [16], 
inverse problems in the spectral theory of differential operators [8], [18], solutions to 
two point boundary value problems [21], and solutions to Fredholm equations of the first 
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and second kinds, 111].) Although this specific problem has not to this author's 
knowledge been treated in the literature, several articles (e.g., [6], (10], [19J) have 
considered the approximation problem without control delays. 

The approach in each of these articles involves expressing the RFDE as an 
evolution equation in the state space R N xL 2> and then approximating the resulting 
dynamical system. Delfour (5] discretizes in both the spatial and time variables, while 
Kunisch [19] and Gibson [10] discretize in only the spatial variable. Delfour considers 
the time-varying problem and obtains weak convergence of the solutions to the 
approximating Riccati equations. In [10] and [19] the open- loop semigroup is first 
approximated by discretizing the history space, and then the approximation theory of 
[9] is used for subsequent convergence analysis. This analysis is based on exploiting the 
relationship developed in [9] between the open-loop semigroup and the Riccati equation 
defining the feedback law. Open loop semigroup approximations have been derived in 
[2], [3], By careful considerations involving the adjoint semigroup together with 
properties deduced from the finite dimensionality of the control space, Gibson was able 
to demonstrate strong convergence of the approximating Riccati operators. When 
coupled with the finite dimensionality of the input space, this leads to uniform 
convergence of the feedback operators. This is a significant result with respect to 
control implementation, since with uniform convergence the optimality of the 
approximating feedback law is independent of the state. 

The convergence analysis presented in [10] and [19] depends heavily on the fact 
that the control map has finite rank. This condition does not hold in the control delay 
problem and straightforward extensions of these convergence results to the control 
delay case are not apparent. However, the form of the feedback kernel derived in [24] 
remains amenable to approximation and convergence analysis regardless of the presence 
of control delays. The reason for this is that control delays do not present any 
complications in the open loop formulation of the optimal control law and that the 
relationship between open loop and closed loop is somewhat transparent in the approach 
of [24] - the feedback kernel is derived from linear operators involving the fundamental 
matrix of the RFDE and the solution to a certain factorization problem associated with 
the fundamental matrix. Convergence questions for approximations to the feedback 
gain then reduce to corresponding questions regarding convergence of solutions to 
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related factorization problems. (We note that the integral Siccati equations of [9] are 
also equivalent to certain factorization problems [23], so that the convergence analysis 
in {10] and [1.9] can also be performed within a factorization context.) 

It will be shown that if the cost on the state in the regulator problem is a discrete 
sum with no integral term, then the associated factorization problem is solved by 
matrix inversion, and the exact feedback kernel can be defined in terms of the 
fundamental matrix solution, quadrature, and the solutions to finite dimensional linear 
equations. (This form of the solution generalizes a result of Manitius [20] for the 
problem with terminal state penalty and no control delays.) Thus an approximation 
scheme for the problem containing an integral state penalty term can be developed by 
approximating this term by quadrature and solving exactly for the feedback kernel of 
the resulting discretized state cost problem. Using this approach together with 
factorization arguments we will be able to establish 0(l/n) L - convergence for the 

OO 

approximate feedback kernels in the time-varying control delay case. 

This result is (analytically) somewhat sharper than Gibson’s in that the 
convergence of the kernels applies on the square as well as the diagonal, and also that 
a priori rates can be provided. The principal reason this sharper result can be obtained 
is that we exploit the fact that only a computable piece of the semigroup - that part 
contributed by the fundamental matrix solution - is required to define the feedback 
kernel. Thus it is never necessary to consider the more difficult problem of 
approximating the entire semigroup. We now briefly outline the organization of the 
report. 

In Section 2 the necessary mathematical preliminaries are developed and 
discussed. Because the approach does not follow along a Siccati synthesis of the 
solution, we will recapitulate in this section portions of the discussion in [24] relevant 
to the present application - particularly certain aspects of the Volterra factorization 
and how they apply to the SFDE control problem. 

Section 3 contains the L - convergence results for the feedback kernels, controls, 
and trajectories. Instead of considering specific quadrature schemes approximating the 
state cost, all the results are proved with respect to a sequence of Borel measures 
satisfying certain convergence hypotheses. The key tool of this section is a 


3 


factorisation lemma which asserts that the factorisation problem is well-posed (in an 
appropriate sense) in the space of integral operators with essentially bounded kernels. 

In Section 4, the explicit form of the optimal feedback kernel associated with 
discrete state cost is derived. The resulting approximation scheme developed from the 
cost discretizations is then used as an analytical tool to obtain further results regarding 
the feedback kernel. For example, using essentially matrix manipulations, a 
Wiener-Hopf integral equation for the optimal feedback kernel is derived which is 
shown to be the control delay generalization of the Wiener-Hopf equation Manitius [20] 
had previously derived for the feedback kernel via a maximum principle. 

In Section 5 two algorithms representing implementation, of the basic 
approximation scheme of Section 4 are derived. In the time-invariant case a fast 
algorithm is derived by exploiting the near Toeplitz structure of the system of 
equations that defines the feedback kernel. A simple numerical example is also 
presented. 


2. PRELIMINARIES . Let [-r, T] denote a closed and bounded interval in the 
real line with r 2. 0 and T > 0, and let denote the class of Borel subsets of 
[-r, T]. For an arbitrary Banach space T, lyi will denote the norm of an 
element y e Y, B(Y» Z) will denote the space of bounded linear maps from Y 
into another Banach space Z, and for brevity we write B(Y) for B(Y,Y). 
Subscripts will sometimes be attached to the norm of an element to remove any 
ambiguities that might arise due to the fact that several different topologies 
will be used in the report. The notation A* (respectively AO will be used to 
denote the adjoint (transpose) of an operator (matrix). 

In the sequel the Banach space of continuous functions C([-r, T], R^) 
will be denoted I, the Hilbert space Lj (I - ** T]» R®*) will be denoted U, and H 
will denote the Hilbert space Ljd-r.T], R^). Now define the resolution of 
the identity E: — *■ B(U) by multiplication by the characteristic function, 

i.e. [E(m)uHt) » X(») (t)u(t) (X(«t)(t) * 0 if t 4 *»# X(«)(t)-1 if t s «), and 
let P* denote the family of projections E([-r, t]). The complementary family. 
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I - P*, will be denoted P t . Note that P* is strongly continuous, 
i.e. t — > P*u i« continnons for each uD. 


In tbis section we shall review some of the results in [24] pertaining to 
the linear regulator problem with dynamics 


i(t) 



n(t,0) x (t + e) + <BP 0 u)<t> 


x(t) - d(t) 


t > 0 

t e [-r.O], 


( 2 . 1 ) 


and quadratic cost functional. 


J (u,x) 



Q(s ) x (s)> dp (s) 



|u(s) | 2 ds. 


( 2 . 2 ) 


In (2.1). we assume that d(t) is continuous, x(*) a X, ueU, BeB(U.H) and 
P 0 is the projection E([0. T]). (It would not affect subsequent convergence 
analysis of the feedback kernels to allow an arbitrary projection P t , t 0 e [- 
r, T], in place of P 0 . The choice t e ■ 0 reflects the problem formulation in 
which control policies cannot be implemented until time t “ 0.) The only 
constraint we impose on B at this tine is that it be causal, i.e., for each 
te[**r,T], if Uj ■ uj a.e. on [0,t] then (Bu^Hs) * (Bu 2 )(s) for a.e. s £ t. 
The matrix valued function q is assumed measurable on 1x1 and is 
normalized so that i)(t,0) * 0 for 0 2. 0 and n(t,0) ■ n(t,-r) for 0 £ - 
r. It is further assumed that i)(t,*) is left continuous for each t and there 
exists a function m e L^(O.T) such that 

IVar n(t,«)l £ m(t) (2.3) 

where |*| denotes any matrix norm. In the cost (2.2), p denotes an arbitrary 
positive regular Borel measure on [-r,T], and Q(*) is Borel measurable with 
Q(s) 0 p - a.e. s and is p - essentially bounded. 
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It is convenient st this time to introduce some operators associated with 
the optimization problem (2.1) - (2.2). Ve define: 


0 

t 0 


LeB(X)- Lx:t 


1 f f d e n(*»®) 

J o 4 


te[-r,0] 

x (s + 0)ds t l 0, (2.4) 


FeB(TJ.X); Fu:t — *- J F(t,s) u(s)ds. 

-r 


(2.5) 


and the adjoint-like F^, 


F*eB(X.U); F^xst — *► Jp'(«,t)Q(«) x(s) dji(s). 


( 2 . 6 ) 


where F(t,s) “ [P 0 B*Y'(t,*)] *(s) and Y(t,s) is the fundamental matrix solution 
of the homogeneous problem (see [12]). Y(*,*) satisfies the Volterra equation 


t 

I - I Y(t,o)n(cr,s “ e)do 
Y(t,s) - J s 
0 


. < t 


s > t. 


(2.7) 


and the solution to (2.1) can be realized as 


0 t 

x(t) * Y(t,0) 4 (0) + f dp f Y(t,o)i\(c, (J-c)do d(p) (2.8) 

-r v 


J- 


+ I Y(t,o) (BP 0 n)(o)do. 
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Also, sop |Y(t,s)l < •, T(t.s) is absolutely continuous for t s for 
etch s, and Y(t,*) is of bounded variation for each t. Ve note froai the 
definition of F(t,s) that F(t,s) - 0 for s < 0. Hence, FP 0 « F and ? 0 F*^ F* 
Using a completing the squares argument, the open loop control law for 
(2.1) - (2.2) can be easily derived in terms of the operators defined above. 

Theorem 2.1. The optimal control u for the regulator problem (2.1) - (2.2) is 

u - U4 


where MeB(X,U), del. 


M - -(1 + F#F)“lF#(I - L)" 1 . 


4(t) 


4(0) t> 0 


t d(t) te[-r,0]. 


Proof [24], 

In [24] the feedback control law for (2.1) - (2.2) was derived from the 
open loop control above using a factorization approach. In an effort to 
motivate this approach and make the report somewhat more self-contained, we 
will briefly retrace some of the steps in [24] leading to the feedback law. 

To sketch how the factorization ideas arise in the feedback control 
synthesis, we will consider for convenience the case p * Lebesgue measure, 
Q(*) >1, U *» H, and B * the identity operator. Using (2.7) we can then write 
(2.1) - (2.2) as 

min <x,x> + <u,n> (2.9) 
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subject to 


* - f + To (2.10) 

where <•,*> is the inner product on U (we can also tafce I ■ U in this 
formulation), T e B(U) is the Hilbert-Schmidt operator 


h- 


Tu: t — ► | T(t, s)u(s)ds, 
o 


and f is the forcing tern in ( 2 . 8 ) resulting from the initial condition 4, 


o_ t 


f(t) - T(t, 0)+d (0) H T (t *cr) r\ (or, p-o)do 6 (p). 


-r o 


Standard arguments then give the optimal control solution u as 
u - -(I + T*T) -1 T*f . 

If we now consider (2.9) - (2.10) with the modification that u e PjJO, 
t 0, and replace f by an arbitrary forcing term f^« the solution to this 
modified problem is realized as 

u t - -(I + P t T*TP t )" 1 P t T*f t . 


Using a principle of optimality (see [24]) it can be shown that the choice 
f t » P t [TP t fi + f] in the modified problem leads to u t - P t « for each t. Thus, 
given a partition of I~r,T], -r ■ t Q < t* < ••• t n - T, it follows that 


A 

u 


53 BU.) Cl + p T*TP t } _1 P t T*f t , 
i t ± t ± t A 


( 2 . 11 ) 


“ [tj, tj+j]. Next, examining the term P t T*f t t it is evident that its 

values are determined by P t f t< . And for s * ti» 

± *1 ” x 
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f«l.) 


I(«.0) 4 (0) + 



(a,p-v)da) 4 (p) 


■S 


Y(s,a)u(a) 


But the above is recognized ss the solution to 

o 

*<*) “ J d e n(s,6)x(s + 9), i 1 t 

-r 

x(s) ■ x(s) t-r 1 s 1 t. 

There x(s) is the optiasl trajectory. The variation of constants formula then 
inplies for s 2. tj, 

*i • 

(a) ■ T(a,t£)x(t£) + J d J T(s,a) n (a,0-a)da)x(p) . 

t r r ti 

Thus the optiasl control in (2.11) at tiae t a [tj.tj+il only uses the 
state inforaation x(s), a a [t^-r.t^]. This control 1 st is sow “open-loop" 
just on each subinterval [tj, tj+^l- The liait as the aesh of the partitions 
in (2.11) tends to zero would hopefully produce the feedback solution. This is 
indeed the case (cf [24]), and although we will not go into all of the 
specifics regarding the liaiting procedure, we will briefly discuss the very 
auch related notion of the projection integral ([11]). 

Using the saae notations as before, let G: [-r.T] — ► B(U). Assume that 
G is strongly continuous, i.e. t — ► G(t)u is continuous for each u s U. Now 
let I a B(U) be a Hilbert-Scha idt operator and consider Rieaann subs of the 
fora 
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2E(«^)KG(tj) , -r * t 0 < ** * < t n ■ T, 


tt i * Iti»t£+il# end s These sums can be shown to converge in the 

operator norm as the mesh of the partitions tends to zero [11], This limit 
is represented by the projection integral 


/< 


dEKG(t), 


( 2 . 12 ) 


and it can further be shown [11] that 


I 1 lll HS snp |G(t)l. 


(2.13) 


Here I’Ihs denotes the Hilbert-Schmidt norm. 

Two important projection integrals on the space CfC of Hilbert-Schmidt 
maps in B(U) are obtained from the selections G+(t) = P t and G (t) » P t in 
(2.12). The resulting mappings p + , 

p + (K) - fdEZGJLt) 

are bounded projections on CfC. II I i B(p ± ) »• say that K is causal 
(anticausal). The elements of R(p + ) are quasinilpotents. In the sequel we 
shall also write K + for p ± (I). Note also that P*U and P t U are invariant 

subspaces of p_ 00, respectively. 

+ 

Ve note that in the space U a Hilbert-Schmidt map K is necessarily an 
integral operator with kernel, say K(t,s). In this case p + (K) are simply the 
Vol terra operators 


p + (R)u: t — ► J K(t, s)u(s)ds, 
-r 


S 


P (K)u: t — ► / I(t, s)u(s)ds. 
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With this bit of background ve can now state the basic factorization 
results that will be used in the sequel. 

Theorem 2.2. (Gohberg-Krein). Let X e CfC. Then there exist unique 
operators X ± e R(p + ) such that 

I + I - (X + X )(I + X+) (2.14) 

if and only if (I + P t XP t ) is invertible for each t e [-r,T], Furthermore, f 
« (I + I )“* - I is given by the projection integral 

¥ « - 

with 

G(t) - P t (I + P^t)" 1 

The decomposition of (I + X) into the product in (2.14) is called the 
Volterra or special (right) factorization of I + X, and ve will sometimes 
refer to X + as the causal (anticausal) factor of K. Note that uniqueness of 
the factorization implies that X ** (X + )* when X is self-adjoint. 

Two results that will be useful in subsequent convergence analysis are 
stated as corollaries below. 

Corollary 2.3. Suppose X i & is self-adjoint with 

sup \ (I + P t XP t ) _1 l 1 k. 
t 

Then I + X has the factorization (2.14) and ¥ + « (I + X + )~*- 1 satisfy 

•"JhS-S kl*lHS- 

Corollary 2.4. Let 1^ « Cf{, i m 1,2 bo self-adjoint with 

sup | (I + P^P*) -1 ! 1 k j* 
t 
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Let f J - (I + Xj)" 1 “ I where lj denote* 
the inti causal factor of Then 

1*1 ~ **2^HS — * in k i 1*1 ” K 2^HS + k j 
Proof. Let Gj(t) ■ P t (I + P t KiP t ) *• Then 

1*1 ” *2lHS - 1 f^l^l " G2>IhS + I f d E <*1 " ^X^lHS 

1 IXl I hS 1*1 “ *2 tas k l k 2 + 1*1 “ K 2*HS k 2 
“ *2^*1 “ k 2^HS + 1*1 Us k l^ 

Symmetry of the argument with respect to Xi and Kj ** ves result.// 

Returning now to the control solution in (2.11), it can be shown that 
the Riemann sums 

SeU,) {I + P t T*TP t ) -1 P t T* 

ill 

converge to the projection integral 
^dEd + X*) -1 T* 

where X is the causal factor of T*T, i.e. 

I + T*T « (I + X*)(I + X). 

Now given an element h e Ljd-r.Tl.U) it is further possible (see [23]) to 
attach meaning to the expression 


as a limit of sums 


E E(« i#n )d + xWh i#n 

i«l 

where the sequence of simple functions 
h n -ExUj , n )(t)h i#n 

converges in I» 2 ([-r,T],U) to h. The expression (2.15) is precisely u when 
h(t) is chosen via the principle of optimality* h(t) ** P t [TP*u + f] (cf 
( 2 . 11 )). 

Now we proceed to define the feedback solution to (2.1) - (2.2). This 
follows from a verification of hypotheses and the evaluation of (2.15), with h 
chosen above. Complete details can be found in [24]. 

Define the Hilbert space H^ - L 2 ([-r.T], R**/ p) as the space of 
p - square integrable functions on [-r.T] with. values in R^. It is evident 
that the map Q: defined by (Qx)(t) ■ Q(t) x(t) is bounded and that 

has the representation * F*Q where F* is the adjoint of F considered as 
a mapping in B(U.H||)« Hence, F^F “ F*QF ^ 0. In [24] it is verified that 
F^F is Hilbert-Schmidt, so that Theorem 2.2 implies that (I + F^F) has the 
factorization 

I + F*F « (I + X*)(I + X) (2.16) 

with X cauaal. 

Now let ¥* « (I + X*)“* - I. Since W* is Hilbert-Schmidt it has a 
matrix kernel W_(t,s). Next define the M x N matrix valued function P(t,a) on 
[-r,T] x [-r.T] by 
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P(t,a) 


(2.17) 


/• 


Kt, *)Y (s,a)dp(s ) . 



where 


Kt.s) 


F' (s. t)Q(s) + 




o) F' (s.o)Q(s )da 


(2.18) 


and T(t.s) defined it in (2.7). The function P(t # a) provides the feedback 
solution to the regulator problem. This is made precise in the following. 


Theorem 2.5. The optimal feedback control for (2.1) - (2.2) is given by 


min(T. t+r) 


A 

u(t) 


-P(t.t) x(t)- 




'/ 

t-r 


P(t # a) I dp n(a»p-ti) x(p)da 



P(t.a) (BP t u)(o)da 


A . 

where x denotes the optimal trajectory, and for each t. P denotes the 
projection on U, (P t u)(s) * X [-r, t](s)u(s). Furthermore. P(t,a) is square 
integrable (Lebesgue measure) on both the diagonal and the square [-r.T] x 
[-r.T]. 


3. CONVERGENCE RESULTS . The specific optimization problem we shall be 
considering is the following: 
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(3.1) 


■in J (u,x) 
u.x 


<x(T) , Q 0 x(T)> + 


/ 

— T* 


<x(s), Q(s) x(»)> + |u(s)| 2 ds 


subject to the constraint 


x(t) - J d$ n(t,0) x (t + 0) + (BP 0 n)(t) t 1 0 
-r 


(3.2) 


x (t ) «d(t) 


t e [-r,0]. 


(3.3) 


▼here d(*) e C([-r,0], H**) end 


(Bn) 


(t) « 2 xtrj-r.T] (t)B i (t)n(t-r i ) + j B(t.B)u(B)d0. (3.4) 

i-0 *t-r 


The assumptions on q(*,*) are the sane as in the preceding section. We 
shall assume that 0 - r Q > - ri > ... > -r k « - r and sup iB^t)! - b £ < •* sup 
lB(t,B)l - b < -. In the cost (3.1) we impose continuity on Q(s). 

Interpreting these assumptions in the context of Section 2, we 
have ji * X + 8 where X denotes Lebesgue measure* 6 is the Dirac measure with 
support on (T), and Q(*) is uniformly continuous on [-r,T) with Q(T) * Q 0 . 


Now consider the following sequence (J n ) of approximations to the cost 
J(u,x): 


J n (u,x) 


T 

J <x(s), Q(s) x(s) > d|i n (s) 


T 

+ y lu(s)l 2 ds* 
-r 


(3.5) 
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where j # a sequence of positive regular Borel measures inch that: 


HI. |» n {T} “ 1 for all n, 

H2. Given a > 0 there exists m such that n * implies 
l|i & [a,b) - lb-all < a for all a < b, a,b a [~r,T]. 

In this section we will discnss the convergence properties of the 
solutions and feedback lavs corresponding to the coat approximations above. 
Henceforth we refer to the optimization problem with cost (3.1) as problem & , 
and the problem with cost (3.5) as problem ^* n . Unless otherwise noted, 
subscripts appearing on operators, functions, etc. (e.g. F^) will indicate 
that these terms are associated with problem 

Ve begin with the following simple result. 

Lemma 3.1. Let |t be defined as above and let satisfy HI and H2. Then 
ji^-5 — ► X (Lebesgue measure) in the w* topology of C*(-r,T). 

Proof. Choose a > 0 and let fa C(-r,T). Let n * C t ^ * partition 
of [-r,T] such that for It - si < Ixl, I f ( t ) - f (s ) I < a. Then, 

T n-1 

I j f d(n n - 6) - ^ f < t i>»*n lH.ti+l>l < en n [-r.T], 

-r i-0 

and 

T n-1 

I / £ d * “2 f(t i )(t i+l * t 4 )l < a(T+r) . 

-r i-0 


.16 


By H2, sup ! |i j [-r,T)| ■ k < •. Now choose m such that a' > m implies 
lp B ’[a,b) ~ la-bll < e/nlfl. Then it follows that 


n-1 

I ^ {p B » [t£,t i+1 ) “ (t£+l - t£ ) } I < S. 

i-0 


Hence* 



<*Sa* “ 6) 



< s + 2ke. 


And the lemma is proved.// 


Since by definition F(t.s) - [P 0 B*l'(t,*)] '(s) (cf (2.5 )-(2.6) ) , 

from (3.4) it follows 


k 

F(t, s) « 2 (t.s) 

i**0 


where 

****** \ e ■ * / # v 

f 0 (t,s)« T(t.s)B 0 (s)+ J T(t,B)B(e.s)d6, 

s 


(3.6) 


and 


fi(t.s) - Y(t.s+r i )B i (s+r i ) , i*l,...k. 


Now let y “ sup |Y(t,s)l. Then using [12, p.149] and the bounds in (3.4) 

we have 

*op l* 0 (t,s)| lr(b 0 + hr), sup If i(t. m) I £ b iT . (3.7) 

t,s t,s 
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(3.8) 


Also, for t 2 > *1 1 • + r i' 


*2 

lf 0 (t 2 »«) ” l“ll J »(<y)d» 0> o + brJ 

*1 

+ by(t 2 - t 2 ). 


sad 

H 

lfj(t 2 .s) - £ bi exp / m(o)do. i-1, ... k. (3.9) 

*1 

It is not difficult to show (see [24]) that F* « F*j , fJ “ F*j n where F 
is the B-space adjoint of F, i.e. F*: X* — ► U. and j, j n are the mappings of 
X into X*. 

j (x)y - f<y(* ).Q(s) x(s)> dji(s), 

* /< y< s). Q(s) x(s)> dp n (*)« 

Now it follows easily fron definition and the estimates above that F is 
compact. Thus, using the w*- convergence of j„(x) - * j(*) * or «*ch x (from 
Lemma 3.1). it can then be deduced that F^[ — *• F# strongly. Consequently from 
the compactness of F it also follows that F^F — ► F^F uniformly. Noting the 
form of the open loop control law (in Theorem 2.1), these general 
considerations are enough to demonstrate the L 2 ~ convergence of the 
approximate optimal controls and the uniform convergence of the corresponding 
optimal trajectories resulting from approximations based on However, the 

major aim of this section is to produce the stronger L. - convergence of the 
approximations for the feedback kernels as well as the controls, and this 
requires a somewhat more specific analysis. 

Let Z denote the space L„( [-r,T] .R® 1 ). From the definition of F(t,s) it 
is evident that F^ and f£ are also in B(X,Z). Our first result sharpens the 
convergence of F^[ — > discussed above. This result (and the method of 
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proof) will fora the basis for the - convergence argnaents later. 

Lemma 3.2. F^[ — ► strongly in B(X.Z). 

Proof. Let z a X. By definition 


k 

(F^-f£]x: t — ► ^*"1 Jf I(s, t)Q(s)i(s)d(u~H«) (s) . 

i-0 


For each i define 


fjU.t) 


Similarly define 


QjU.t) 


and 

xj(*.t) 


f i(*,t) 

fjtt+rj.t) 

Q(s) 

Q(t+rj) 

z(s) 

z(t+r i ) 


s > t + ri 
s i. t+rj. 

a > t+rj 
* 1 t+r i» 

s > t+r^ 
a lt+rj. 


Considered as families of functions parameterized by t, {fj(*»t)) is 
eqnicontinnons by virtue of (3.7)— (3.9). and (Q^(*.t)} and {ij(*,t)) are 
eqnicontinnons by virtue of tbe uniform continuity of Q(*) and x(*), 
respectively. Furthermore these families are clearly uniformly bounded. 

Hence the set 


S - (x t eCU-r.T] f 1 M ): x t (s) 


f *(s. t)Q^(s. t)x^(s, t),t al-r.T-r^] , 


i«0,l,...k} 
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is relatively compact in C([-r,T] ,R M ) . Now note that 


k 

I IF* - F{](x)(t)| < £{| f f i (s.t)Q i (s,t)x i (s.t)d(p-|i n )(s)l 

i°0 [— r,T] 

+ I J' f t)d(n~it n ) (s)|j 

I-r, t+r^l 


Using the compactness of S it follows from Lemma 3.1 that the first 
integral above converges to zero uniformly with respect to t. And since the 
second integral has constant integrand for each t and i. uniform convergence 
is obtained here by nsing 02.// 


Now let H(t.s) and H n (t,s) denote the kernels of F^F and F^F respectively. 
Fubinis theorem implies 


|H(t.s) - ^(t.s)l 


•2 f S i {a , t)Q(cr)fj (e.sMtp-jij 

l.J 


)(<x)l. (3.10) 


Arguing as in 
uniformly to H(t»s). 


the previous lemma we can show that H n (t,s) converges 
To see this note that for each fixed i and j 



(o,t)Q(o)fj(o,s)d((i-|i n ) (o) 



[s+rj,T] 

[t+ri#T] 


f i(o,t)Q(<r)fj (o.sJdtp-iijj) (o) 
fi(o,t)Q(o)f j (o,s)d(p-p n ) (o) 


s+rjit+ri 
s + rj<t+rj 


Thus on the set V * {(s,t): s + rj 


^ t + rj) we can write 



t)Q(o)fj (o,s)d(|i**|i n ) (e) 


I 

l-r.T] 


'Vp / 'V 

f j(o,s,t)Qj(o,s)fj (o,s)d(|i-|i n ) (a) 


- / 

[-r, s+rjl 


fj(o, s, t)Qj (<r, s)fj (a, s)d(n“n n ) (a) 
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▼here Qj(a ( g) and f j (a, s) are defined a* in Lemma 3.2 and 

f i(ff.t) a 1 a+rj 

f^O.S.t) '» 

fi(s+rj,t) a < a+rj . 

Uniform convergence of the integrala parameterized by V follow* in the 
manner of the proof of the lemma. Since thi* argument alto hold* for 
anbaeta of the form {(s.t): • + r j < t + rj)» we obtain the following 

reanlt. 

Lemma 3.3. Vith the notationa above. H n (t.s) — ► H(t.s) uniformly on 
J-t.T] z [-r.T]. In particular. F^F — ► F^F in the Hilbert-Schmidt topology 
(in B(U)) and in the B(Z) topology. 

To obtain L» - convergence of the feedback kernel* we will ute a result 
analogous to the one above regarding the convergence of the kernels of the 
Volterra factor* of (I + F^f). Already we can use Corollary 2.4 and Lemma 
3.3- to obtain Hilbert~Schmidt convergence of the factors (hence. Lj ~ 
convergence of their kernels). But our ultimate interest is to demonstrate 

* -trC-gCT — wuuv 6i|?ace ui ute Iciuci »« "u us o u tLc ivi iuw 1U5 tw w 

results which are of some interest in their own right. 

Proposition 3.4. Let K be an integral operator on U with essentially 

bounded kernel Z(t»s). esssup lz(t.s)l ■ fi < •. Suppose that 

t,s 

sup I (i+p t n» t )-i | « o < • 
t 

so that (I + X) has the factorization 

X + K - (X ♦ X*)(X + X) (3.12) 

with X causal (cf Theorem 2.2). Then V ■ (1+ I)~* - I and V* are integral 
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operators with kernels W + (t,s) satisfying the bound 

esssop |V (t,s)l < p(o,p) (3.13) 

t,s i 

where 


P<o.p) - pU + p (T + r) (1 + op (T + r)) 2 ]. 

Proof. First note that Corollary 2.3 iaplies T * (I + X)“ 2 - I is Hilbert 
Schmidt with l* IjS — a ^HS* Now the factorization (3.12) iaplies 

(I + X*) - (I + X) (I + W). 

Snbstracting the identity froa the above and applying the projection p_ (i.e. 
taking anticansal parts) results in for 0 2 t, 

X_(t,9) - X(t.«) + 

e 

where X_(t,9) and f + (t,9) are the kernels of X* and W respectively. 

Hence for a.e. t,9. 


y 


X(t,o)T + (<x,0)da a.e. t,0. 


IX_(t,«)| £ p{ 1 + 


/*♦ 

-r 


(o,6) Ido) . 


Consequent ly. 


T T T 

J lx_(t.e)|2d« £ p 2 f u + f lw + (0.e)ldol 2 d« 

-r -r -r 
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Let a 
identity W* 


T T 


T T 


P i 2 * {(T + r) + 2 // |W + (o,0) Idod0 + J r ^ylt + (ff,0)|d a 2 d0 


-r -r 


-r -r 


1 P 2 (T ♦ x) (1 + |W + I HS ) 2 

1 P 2 (T + x) (1 + a |lI HS ) 2 

£ p 2 (T + r) [1 +«p(T + x)] 2 e.e. t. 

P 2 (T + r) [1 + ap(T + x)] 2 . Then since 1* satisfies the 


-X* ~ X*f *, we have 


ll_(t.0)l £ | X_(t,0) I + J |X_(t,e) f_(«x,0)ldo a.e. t,«. 

-x 


Hence* 


T T T T 

J lw_(t.e)l 2 de £ (t,0)| 2 d« + 2 / lx - (t.e)| J"- (t.s)l |W_(s.e)|dsd0 

-x -x -x -x 


T T 


f [j lx_(t,s)l lw_(s,e)lds] 2 d« 


-r -r 


£ « ♦ 2a lfJ HS + a lw_lg s 
£ o( 1 + alxl HS ) 2 a.e. t 


Thus, 


i 

• assnp y* lf_(t.©)| 2 d« £ p 2 (T + x) I 1 + «p(T + x)] 4 . 
-x 


(3.14) 


Now (3.12) also implies 


I ♦ X - (I + I*)(X ♦ I). 
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Subtracting the identity and applying yields 


1* + X_+[l*lJ- 0. 


Hence* 


lw_(t.e)l £ IX(t.8)| + 


fa 

-r 


<t*s)llx(s.8)lds. 


£ 0{1 + (T + r) 1/2 ess snp 

t 



|W_(t.s)l 2 ds] 1/2 } 


The result follows from 3.14.// 

As the result above may be regarded as the L„ analogue of Corollary 2.3* 
the next proposition is the L s analogue of Corollary 2.4. The notations X, T. 
X± (t*s) and W + (t,s) will have the same meaning below as in the preceding 
proposition. 

Proposition 3.5. Let X be as in the proposition above and let (K n ) denote 
a sequence of integral operators on U with essentially bounded kernels K^U.s) 
such that 


and 


ess sup lE^U.s)! £ 6. 

t,s 


lim ess sup ll n (t.s) - I(t*s)l - 0 

n t,s 

Assume further that Xn 1 0 for each n so that I + X n has the factorization 
I + Xjj - (I + lj)(l + x n ) (with Xn causal). 

Let W n (t,s) denote the kernel of the integral operator (I + X^ -1 “ 
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Then 


lin ess snp lw a (t,s) - ¥+(t,s)| « 0. 

n t,s 

Proof. First write I + I* - I + I + (K n - *> »o that 

I + In - (I + X_) (I + A^) (I + I + ) (3.15) 

where A* - (I + - D (I + X)" 1 . By Proposition 3.4 ess snp 

- - * ti j S 

|1 ± (t.s)ll o for soae o < • . Now since I + K a has the factorization, so 
does I + A n . Specifically, I + A n - (I + ij) (I + T n ) where I + T n - 
(I + I n )(I + T). It then follows from the identity 
I + P t A n P t - (I + P t TlP t )(I + P t Y n P t ) 

that 

snp 1(1 + P^Pj)” 1 ! £ snp 1(1 + P t T n P t ) _1 | 2 
C L (1 + IwJ) 2 anpl(I + PjWPt)” 1 ! 2 

- 11 + l K nlHSl 2 t«Pf 1/2 < 1 + 3 2 

Here we have nsed Corollary 2J to obtain the first tern in the prodnct, 
and the fact that 1 is Hilbert-Schaidt and qnasinilpotent together with [7 ,p. 
1039] and Corollary 2.3 to obtain the second tern. Now define Z„ “ 
(I + Y n )“ 2 - I and let Z n (t,s) denote its kernel. Then since 

ess snp lA a (t,s)l£ P n [l + ad + r) + a 2 d + r) 2 ] 
t ,s 

where P n « ess snp |K(t,s) - K n (t,i)l and a L ess snp |f ± (t,s)|, 

t,s t,s 

Proposition 3.4 iaplies 

ess snp Z a ( t ,s) £ p<«^, p a ) 

t > S 
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with 


0 n - 0 n tl + o(T + r) + « 2 (T+ r) 2 ] 

•ad 

a n - [1 ♦ |l n l HS ] 2 [exp{l/2(l + Ill|s>] 2 * 

Finally, note that 

*n " * + 2 n + ^n* 

•o that 

ess snp lt n (t,s) - I+(t,s)l 1 p(a n ,0„)[l + (T + r)o] . 

t,s 

<\j 

Bat p(o n , p n ) « 0 (p n ) . This completes the proof.// 

Before proceeding to the main result of the section we will make a 
digression to establish convergence of the approximate control sequence and 
trajectories. Again we let Z denote the space L^d-r.Tj.R^). For notationsl 
convenience we also introduce the subspace X Q C ^ initial conditions. 

X 0 « {x e X: x(t) * x(0) for t 2. 01 

and endow it with the subspace topology. 

Theorem 3.6. Let & denote the unit ball in C ([-r.Ol.R^). For i let 
u n (d) and u(d) denote the optimal controls for problems *od & respectively. 

A A 

Also let x n (4) and x(4) denote the corresponding trajectories. Then* 
uniformly on 3$, 

(i) lim |i n (d) - u(4)\ z - 0. 

(ii) lim l£ a (d) - £(d)lx “ 0. 
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Proof. Let 4 t3i. The ores 2 J (ires the opt Inal control law aa 


^(d) - -(I + FfF) _1 Fj(I - L)" 1 ? 

where d a X Q ia the extension of 4 anch that d( t) ■ 4(0) for t2. 0* 

Now (I — L)~*d is recognized as the solution to the homogeneous problem 


x(t) 



T\ ( t » 0) x (t + 6) 


-r 


with initial condition 4. Denoting this solution x(d), standard arguments 
(see [12]) giwe 

|x(d)(t)l £ Idlexp Imlj (3.16) 

and 


lx(d)(t 2 ) 


xWXtj)! 1 Idl 



explm!^. 


(3.17) 


Thus the set S, 


S - {x t C ([ 0,T ] , EH). x(t) . [(I _ L)"ld](t) for 4 t X 0 with Idl < 1} 


is relatively compact in C([0,T] ,R N ). Now, the triangle inequality yields 

•®n “ «lz - 1 (I + f Jf)“ 1 I B (2) * l<Fj - F*)(I - L)" 1 ?^ (3.18) 

♦ Id ♦ fJf)” 1 - (I + F # F)" 1 | b(z) * |F*(I - D^dlz 

Let 6 2. ess aup H n (t,s) for all n (cf (3.10) and Lemma 3.3). 

t,s 
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Also let W n ■ (I + X^ -1 “ 1 »here X^ i* the causal factor of F^F, i.e. 


(I ♦ F#f) - (I ♦ ij) (I + X n ) . 


Thus. 


U+fJf)” 1 - (I + l n )(I + wj). 


Nov let V n (t.s) denote the ternel of V n . Then Proposition 3.4 
IV n (t. s) I £ p(1.5) independently of n. Hence. 


ia>pl ies ess sup 

t,s 


Id + fJfJ^Ibcz) £[1 + p(l,6) (T + r)] 2 . 


(3.19) 


mad consequently 


I (I+Fj F r 1 -(I+F # F) _ 1 l B(z) <| (I+FfF) _ 1 l B(z) |F # n F-F # Fl B(z) | (I+F # F) _1 I b(z) 

£ [1 + p(l. 6)(T + t)] 4 |fJf - F # Fl B(z) (3.20) 


Ve also have from (3.16), 

|F*(I - L)" 1 7 l z £ Idl If # I b(1>z) • exp Iml} (3.21) 

By Leaaa 3.2. F^ — *> F^ strongly in B(X, Z). Recall now that F(t,s) (the 
kernel of F) vanishes for s < 0. Then since S is relatively compact, it 
follows that (F^ ~ F^)(I _ L)“ 2 — ► 0 uniformly in B(X 0 *Z). This result and 
Lemma 3.3 together with (3.1 9) - (3.21 ) inserted into (3J.8) proves the first 
assertion of the theorem. 


To prove the second part let y ft ( 4 ) - x n (d) - x(d). Then y n (d) satisfies 
(3.2) with xero initial condition and forcing term B(u n “®). Therefore for some 
constant C. 


Iy n (d)(t)l < C ess sup lu_(d)(t) - n(d)(t)l. 

t 
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Urns (ii) follows from (i ).// 


Next we present the convergence properties of the sequence of 
•pproxinating feedback kernels. 

Theorem 3.7. Let P n (t»a) and P(t.a) denote the feedback kernels of 
Theorem 2.5 associated with problems^ and SP respectively. Then. 

(i) lim ess t sup |P B (t.t) - P(t,t)l ■ 0, 

(ii) lim ess sup |P (t.o) - P(t.o)l ■ 0. 

n t,a “ 

Proof. From (2.17) - (2J.8) we write 


k T 


(t.a) * 2/ K n#i (t.s)I(s. 


a)dp n (s ) . 


i»0 a 


k T 


P(t,a) * y KjCt. s)Y(s.«)d(i(s), 


i«0 a 


where 


i-fi 


K n,i (t ** ) “ fj(s,t)Q(s) 


+ f f n (t,e)f j(s,e)Q(s)d6. 

•'t 


and 


K i (t.s) « f i (s.t)Q(s) + 


*” r i 

f f(t.e)fj(s. 


e)Q(s)de. 
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Thus, 



|P n (t.a) - P(t.o)l 1 £ tl y I K n.i (t » #) " ^(t. *)]!(•, aJd^ts)! 


i-0 a 


1 


s)Y(s,a)d(|i - tt.) (*)!} (3.22) 


*” r i 

l r n.i (t **) * *i<t.t)l 1 y* lw n (t,e) - w(t,e)l If •(«.©) I Iq(*)|, 


Lemma 3J and Proposition 3.5 imply that 


lim ess snp If (t,0) - f(t,0)l - 0. 

n t,6 n 


And since |fj(*,0)l and I Q(s ) I are uniformly bounded, routine arguments yield 
a measurable set Q C [-r,T] whose complement has zero Lebesgue measure such 


K B ^£(t,s) — ► K^(t,s) uniformly on 0 z [-r,T]. 

Using the uniform boundedness of Y(s,a) and the sequence of measures 
it follows that the first integral in (3.22) tends to zero uniformly on 0 z 
[~r,T]. To prove convergence of the second integral in (3.22) we argue as in 
Lemma 3.2. Define the family of functions parameterized by a and 0, 

{Y( * ,o,0)} , 


Y(s,a,0) 


Y(s,a) s > 0 


Y(0,a) s < 


3 


and the family of fnnctiona parameterized by t and y, 


/ *i(t,s) * 1 T 


*i(t,y, a) 


* < T 


It ia straightforward to verify asing the properties of K^(t,s) and Y(s,a) 
that the set 


* )Y(* ,a,p) : p * max {a.t+rj}} 

is relatively compact in C (t-r,T], R MxN ). Thns the argument in Lemma 3.2 
applies here to demonstrate that 

T 

lim ess sup I / K i (t,s)Y(s,u)d(p n -p) (s)l -0. 
n t',a *' n 

a 

And the theorem is proved.// 


4. APPLICATIONS . In this section ve begin by deriving the optimal feedback 
kernel associated with an arbitrary discrete state cost penalty. It will be 
evident that given the fundamental matrix Y(t,s), the feedback kernel in this 
case can be derived by quadrature and matrix inversion. This feedback 
structure when combined with the results of the preceding section leads to 
approximations to the optimal feedback kernel of problem^ (recall (3.1)- 
(3.3)), a Viener-Hopf characterization of this kernel, and a priori bounds on 
its magnitude. 


Let v denote a positive discrete measure on [-r,T] of the form 



n-1 

^(*n) ajftsj),' S£et“r,T] , Sg 

i-0 


T. 
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Inserting this netsnre into (2.2) results in the cost 

n-1 T 

J (u,x) = <x(T), Q(T) x(T)>+E». ^(sJ, Q(s.) x (s^ + f hi(s)| 2 ds (4 * 1} 

i=0 -r 

The optimal feedback kernel for this cost has the following seai separable 1 
structure. 

Theorem 4.1. Define the matrix functions G(t) and Y(t)» 


G(t) « [^ 0 F'(s 0 .t)* : '£^F , (s 1 I _ 1 ,t) j F'(s n »t)]' 

T(t) - I^ 0 T'(. 0 ,t): (.*_!, t>:Y'<. n .t>]'. 


Also define the matrix 


«<*o> 


PS* 

Q 


L Q( *-’ J 

Then the optimal feedback kernel P(t.a) for the problem with dynamics (2.1) 
and cost (4.1) can' be expressed as 


P(t,o) - G r (t)Q(I + D(t)) _1 Y(a) 


(4.2) 


where 


U(t) 



G(s )G* (a )Qda. 


t 


(4.3) 
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Proof. It is straightforward to verify that the asp F#F resulting froa 
the aeasure v in (4.1) is an integral operator with separable kernel 
G’(t)QG(s). How Theorem 2.5 implies 


P(t,o) 



(t, s^)Y(s|,a) . 


Here 

Kt.s^ - F^Si.tJQCsi) + 

and W_(t,o) denotes the kernel 
factorization 

(I + F # F) _1 - (I + W) (I + 

Using the fact that F^F has 
[11. p.188] that 

W_(t,o) - -G'(t)Q[I + VU)r X G (a) 

with U(t) defined as in (4.3). Thus we can write 

P(t.o) «53 {v/ *i F '<*i»t)Q(s 1 ) - G' (t)Q[I + O(t)]" 1 

G(o)v / a£F' (sj,c)Q(s£)do}\/ajY(s£»a) 
t 



^*W_(t,o)F' (s i ,«)Q(s i )do 
t 

of the anticansal operator W* in the 


I*). 

• separable kernel, it can be verified 


Noting the definitions of G, T and U, it follows 

P(t.o) - [G* (t)Q - G’(t)Q[I + U(t)] -1 U(t)]I(o) 

- G* (t )5fcl + U(t) .// 

Note that if in (4.1) we take for i #n, a^ * 0, and let the operator B 
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denote a mul tipi ication operator, i.e. (Bn)(t) * B(t)u(t), then (4.2) reduces 
Manitius' result for terminal state penalty [20] 

T 

P(t,a) = BWYTT.t) Q(T) [I + J YfT.aiBfaiB^cOY'tT.aiQmda]"^^^). 

t 

Thus Theorem 4.1 can he viewed as an extension of this result to problems 
with control delays and arbitrary discrete state penalty. 

Now let be a sequence of discrete positive measures satisfying HI and 
H2, and let P n (t,a) denote the corresponding feedback kernel s. The theorem 
implies each P n (t,a) has the semiseparable form (4.2), while Theorem 3.7 
implies the L a convergence of P n (t,a) to P(t,a)(the optimal feedback kernel 
for problem &*) and also the L«, convergence of P n (t,t) to P(t,t). Introducing 
subscripts in the obvious way, define for each n, 

V n (t) - [I + U n (t )] -1 - I 
so that the identity 

v n (t> - -n tt (t)[i + von 1 

holds. Multiplying by 6^(t)Q n we obtain 

®;(t)v n (t) - -G;(t)v n (t)n + vor 1 
- -G' n (t>Vl + D n (t)] -1 u n (t). 

Then using the definition of U n (t) and multiplying by Y n (a) it follows that 

T 

G A(t)V n (t)Y n (o) - / Gi(t)G^[I + U n (t)]“ 1 G n (o)Gi(o)Q n doY n (o) 

* / * (4.4) 
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Froa (4.2) and the definition of V n (t) ve have for # 1 t 
P n (t.a) - G;(t)<^V n (t)Y n (a) - Gi(t)Q n Y n (o). 

And substituting (4.4) into the above 


P n^,o) 


qi(t)Q n Y n (a) - /^(tJ^II + U n (t)]" 1 G n (o)Gi(o)Q 11 doY ll (a) 
t (4.5) 


Now let P n denote the integrsl operstor with kernel P n (t,a). We then 
recognize <^(t)Q^[I + U n (t)l _1 G n (a), with o 1 t, ss the kernel of the operator 
[P n B]_. Also G^(t)Q n Y n (a) is recognized as the kernel of F^Y, where Y is the 
operator in B(H.X) defined 


Yu: t 



Y(t, s)u(s)ds. 


(4.6) 


Thus (4.5) represents the Wiener-Bopf equation 
P n “ - I(P n B)_FfY]_. 

Let P denote the operator with kernel P(t,a). Then since 

lia |P n - Pl HS ” 0 and lin IfJy - F#Y| hs ■ 0 

n n n na 

(where the latter is essentially Lemma 3 J), by continuity of the projection 
p_ on the space of Hilbert-Schmidt asps we obtain 

P - (F # Y)_ - t(PB)_F*Y]_ (4.7) 

We fozaalize this discussion in the following. 

Corollary 4.2. The Wiener-Hopf equation (4.7) has a unique Hilbert- 
Schaidt solution P. A version of the kernel of P is the optiaal feedback 
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kernel for the optimization problem <3? (3.1) - (3.2). Furthermore this version 
of the kernel can be approximated in the topology on the diagonal as veil 
as the square by the semiseparable kernels P n (t,a). 

Proof. We only need to prove the uniqueness assertion. Suppose there 
exist two Hilbert-Schmidt solutions of (4.7) and let 6 denote their 
difference. Then ve obtain 

5 + [ (5B) JP*T]_ - 0 

Clearly it is sufficient to show that (6B)_ ■ 0. Define 

6 - -[ (6B)_F#Y] 

so that 6_ ■ 6. Then since B is causal it follows that (6B)_ - (6B)_ and 

b + (TbL^T “ 0. 

Multiplying by B and noting that IB = F (cf (3.6) and (4.6)) 

(6B)_ + [(6B)_F*F]_ - 0. 

We will show that zero is the only solution to the equation 

X + [XF#F]_ - 0, (4.8) 

thus proving (6B)_ ■ 0, and the result. Now (4.8) is equivalent to 
[X(I + F#F)]_ - 0 

with X anticausal. Since F^F 2. 0* there exists a causal Hilbert-Schmidt map V 
such that 


I + F#F - (I ♦ V*) (I + V) . 
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Thus X solves (4.8) if and only if Z solves 

Z + [ZV]_ - 0. <4.9) 

(These solutions tie related by X - Z(I + V*)" 1 .) Next consider the 
■apping SC on the space of Hilbert - Schmidt operators 

i?(Z) - [ZV]_. 

Then (4.9) is equivalent to (I +SC)Z - 0. Nov by induction ve find that 
£? n (Z) - [ZV n ]_, 


so that 


|2*(Z)I £ lzl HS |v n l . 

But V is quasinilpotent. hence so is SC, Thus the only solution to (4.9) 
is Z * 0, and the theorem is proved.// 


We note that the existence portion of the corollary was proved in a 
different manner in [24]. 


When B is a multiplication operator. (PB)_ ” PB, so that (4.7) becomes 
P - (F*Y)_ - [PBF*Y]_. 

The kernel of F^Y is easily computed to be of the form B'(t)A(t.s) where 


A(t, s) 


) 

max (t, s) 


Y' (or, t)Q(a)Y(o, s)do + Y' (T. t)Q(T)Y (T. s) . 


Let A denote the operator with kernel A(t»s) and note that F^Y • B*A. 

Next consider the following modification of the Wiener-Hopf equation (4.7). 

n - A_ - 01BB*A]_ (4.10) 
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Using the sane techniques is in the proof of Corollary 4.2 it is possible to 
show that (4J.0) has a unique Hilbert-Schmidt solution Il 0 * Also note that the 
corollary implies B*I1 0 - P. He Yiener-Hopf equation (4J.0) is equivalent to 
the parameterized family of Fredholm equations Manitius [20] derived via a 
maximum principle for obtaining the feedback kernels: 

T 

n(t,s) - A(t, s) - yn(t.e)B(e)B'(e)A(e.s)d© < 4 . 11 ) 

t 


Corollary 4.2 extends this Yiener-Hopf characterization of the feedback 
kernel to problems with control delays. and simultaneously provides 
approximate solutions. 

The special factorization has been previously exploited in solving 
Yiener-Hopf equations on finite intervals of the type (4 J.0)-(4.11) that arise 
in inverse problems in the spectral theory of differential operators [8], [18]. 
and in the filtering and smoothing problems for nonstationary processes [15]. 
[16]. The corollary is in a sense a solution finding the right Yiener-Hopf 
probl em. 

Now we return to the original problem (3.1) ~ (3.3) and consider a 
specific sequence of measures for generating approximations to the optimal 
feedback kernel. 

Let {p n } denote the sequence of measures 

f “ _1 

Jt* F n - f (T) + (T + r)/n 2 f(i(T + r)/n - r). (4.12) 

i-0 

This sequence is easily shown to satisfy HI and H2. Now suppose the 
majorizing function m(t) (cf (2.3)) is bounded and the weighting function Q(*) 
has a bounded derivative. Letting P n (t,a) denote the feedback kernel 
corresponding to the cost with measure |i a < it i» straightforward (although 
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tedious) to derive a constant C from the results in Section 3 such that 


ess sup |P(t.o) - P n (t,a)| £ C/n, 

*, a 

ess sup |(P(t.t) - P n (t,t)l £ C/n. 
t 

Te can also use the approxinations P n (t,a) 
on |P(t,a)| in the following way. 

First observe that for each t 

5^(1 + U n (t)) _1 £ V n - 1.2.... 


( 4 . 13 ) 


to obtain an a priori bound 


Thus. 


|P n (t.o)l £ lG;(t)llQ tt llT n (o)l. n - 1.2.... 

where all the norns above are operator (matrix) norns on the appropriate 
Euclidean spaces. Now for each o, ll n (a) I is bounded by |l n (u)l HS . (Here 
Mbs denotes the Hilbert-Schmidt matrix norm which is the square root of the 
sum of the squares of the matrix entries.) And 

n-1 

IVa)l|« - |T(T.«)li s + (T ♦ r) S iKsi.alljs 

n i*0 

£ (T + r ♦ l)sup |T(s.a)l§ s . 

s 

A similar bound holds for 

l®£(t)| 2 £ <T ♦ r ♦ l)sup |F'(s,t)l| s . 

s 

Thus* 


|P n (t,o)| £ (T + r + 1) sup lQ(a)l • eup |F'(s,t)|g S • sup |l(s.a)|gs 

as s 
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independently of n. And since P n (t,a) — > P(t,o) a.e. (on the square and 
diagonal), the same bound holds for |P(t,a)l, 

The T- dependence (the length of the problem intervsl) in the bound for 
|P(t,a)| can be expressed differently by noting that 

T 

lim |T n ( a ) • |l(T,a)l§s+ J |l(s,a)|| s ds 

ft ^ 

a 

And after using the analogous bound on |G^(t)l we obtain 

T 

|P(t,o) | <. sup lQ(s)| • {|F(T,t)l| s + J |F' (s,t)l|s d *J 1/2 

t 

T 

• (|T(T,o)l| s + j |T(s,a)li s ds) 1/2 . 
a 

When the system (3.1) _ (3.4) is time-invariant and stable, a bound 
independent of T can be established for |P(t,a)l. To see this we take Q(*) “ 
Q ( a constant matrix) and q(t,0) “ q(O) where all the roots of the 

characteristic equation 

o 

au) - u - y* e xe dr\(e) 

-r 

are assumed to have negative real part. Also for simplicity we define the 
operator B as 

t 

(Bu) (t) - B 0 i»(t) + f B(t-«)u(e)d6 

t-r 
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with |B 0 | , sup |B(t)| < b. From tbe assumptions on qO) it can be shown U.2] 
that Y(t,a) ■ Y(t-a) with |T <t ) I £ c exp(-pt) for some c.p 2. 0. Thus, 

T • 

J * lT(s,o)|§gds £ J" |T(t)l§gds < • . 
a 0 


Since the La place transform Y(X) of Y(t) satisfies Y(X) ■ A _2 (X), (see [12]), 
the inequality above together with Parseval's formula yields 


I 


|Y< 


*»°) - 


2n 


b 


( iX) IggdX. 


Next note that 


■in(s, t+r) 


|P'(s,t)!gs — blY(s-t)Ig S + J |Y(s-e) | HS lB(e-t)l HS de 


1 blYU " t)l HS + G(*,t) 


where 


r*v l i/2 

bvF \J |Y(u ) Iggdu t t £ s £ 


t+r 


0 

G(s,t) - ^ r t+r 

bVr| y* lY(s-B) 
t 


11/2 

■Ssdel 9 • 


> t+r 


Straightforward approximations then yield 


A w 

/ IF' ( * .t)|J s d* £ 2nb 2 (l +%/2r) 2 f U” 1 ( iX) l| s dX. 
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I 


Thu*, independent of the length of the problem interval. 


|P(t,o)| < |q| 2irb(l ♦ y/2x) ( iX) I QgdX. 

5. ALGORITHMS . The results that have been presented thus far have emphasized 
the connections between factorization and the feedback kernels for hereditary systems 
with control delays, and have not focused on any of the implementation issues 
concerning the approximation scheme that has been defined. In this Section we will 
examine in greater detail algorithms based on Theorem 4.1. Particular attention will be 
paid to time-invariant systems. First, a quick remark about these algorithms in general. 

We note that it has already been observed that combining (4.2) with the 
discretizations (4.12) results in (most cases) an 0(l/n) L convergence of the feedback 

OO 

7 

kernels . This result, which is also valid for time varying systems with control delays, 
is sharper than the approximations for the feedback operators obtained in [10], [19]. 
Neither of these articles establishes a priori rates of convergence, nor do the 
convergence results that are established translate into L w convergence of the kernels. 
The underlying reason why we are able to obtain the stronger convergence is that we do 
not approximate the entire semigroup (or evolution operator), but only that piece that is 
contributed by the fundamental matrix (which we must solve for as a separate 
computation). In the general time-varying case with discrete state cost at the nodes 
{s Q }n._^, this amounts to solving the n+1 Volterra equations 

s. 

l 

Y(s^,a) = I - J Y(s.,u)ti(u,cr-u) du (5.1) 

a 

In the time-invariant case these computations reduce to the single Volterra equation 


t 

Y(t) = I - X Y(uVn(u-t)du. 
o 


(5.2) 


42 


Once we have the solutions (5.1) or (5.2), the feedback structure (4.2) is straightforward 
and can be computed from quadrature, matrix inversion and multiplication. 

Of course we are not constrained to directly solving (5.1) or (5.2), and we can use 
other methods for obtaining the fundamental solution - e. g. state approximation 
methods [2], [3], the method of steps (26], [27] or other available methods for solving 
Volterra functional differential equations [28], (29], [30]. Having said this, we assume 
throughout this section that the functions Y(t,s) and F(t,s) have been computed. (A 
couple of issues associated with these computations will be addressed later.) 

One straightforward implementation of (4.2) consists in defining the grid 
so that = A = T/n, taking a^ = A, and replacing the integral in (4.3) by a first 

order Euler quadrature with nodes {s.}. To see where this leads us, first write P(t,a) in 
the more symmetrical fashion 

P(t,a) = G’(t)Q V, [I 4 U(t)f l Q V, Y(a), 

where 

T 

U(t) « Q V * JG(s) G*(s)dsQ X/ * 
t 


A 

Let U(s.) denote the approximation to U(s.), 
1 ^ 1 


6(8^ * £ AQ V *G(s.)G , (s j )Q Vl , 


A 

and form the approximations (P(s^,s.)}. ^ ^ to (Pte^s.)} . ^ 
P(s.,s.) » G’fs^Q^lI + U(s.)f l Q V, Y(s.) 

Now note that 


(I 4 Ute^j)]" 1 » II 4 U(s.) 4 A Q V, G(s.)G '(s.)Q V *f \ 


(5.3) 
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(Recall that M * dimension of the input space.) Thus, rising the matrix identity! 


(X 4 YY*)“ l « X 1 - X' l Y (Y’X —l Y 4 D l Y‘X 1 

A _. 

for compatible matrices X and Y, it follows that (I 4 U(s. j)) can be updated from 
(1 4 U(s.))~ 1 in about 2MN 2 n 2 operations. Further, exploiting the semiseparable 

1 A 

structure of P( • , • ), a rough operation count shows that {P(s^,s.)}. ^ ^ can be computed 
in approximately 3MN 2 n* operations when M i N « n. 

Considering that there are more than MNn 2 /2 values in the matrices 

A 

{P(s^,s.)}. j this algorithm is fairly efficient. However, we will subsequently show 
that it is possible to do substantially better in the time -invariant case. (We will also 
provide a more complete analysis in this case.) 

For the remainder of this section we consider the problem defined by the dynamics 
0 k t 

x(t) » J dn (0) x (t40) 4[ B. u(t-r.) 4 J B(t-6) u(0)d0, t 2 0 (5.4) 

-r 1*0 t-r 

x(t) * 4>(t), t c l-r,0] 

u(t) * 0, t c l-r,0] 

and cost 

T 

J(u,x) » J <x(s), Qx(s)> 4 |u(s)| 2 ds (5.5) 

0 

The assumptions here are the same as in (3.1) - (3.4), except now everything is 
time- invariant. (Previously the lower limit in the integral defining the cost J was taken 
as -r. This served as a notational expediency in the preceding sections, which we 
dispense with in the present section.) 



We will explicitly consider the discretizations of J, 
n-1 T 

J n «E A Q x(s i » 4 S Ns)| a ds, (5.6) 

UO 0 

where {s.J^g is * regular partition of the interval [0,T] with mesh A » s^ ^ - s. « T/n. 

(Although the mesh points of the partition change with n, we will not double subscript 
the s.. This will not lead to any confusion in the sequel.) We will also assume that the 

point delays in the control, r., i»l,...k, correspond to some subset of the {s^}, i * 0,...n. 

Again we let P (•,•) denote the optimal feedback kernel for cost J and let 
n n 

P( ) denote the optimal feedback kernel for cost J. hi the time-invariant case Var 

ItK *)| and Q(*) are constant, so that (4.13) holds for the sequence {P^( *,•)}. The 

particular algorithm which we will be developing is based on approximately P^ ( • ,♦ ) at 

the mesh points {s.h thus it is first necessary to prove that (4.13) actually holds 

everywhere. Before showing this we need some notations and a couple of simple 
observations. 

For any matrix M, as before )M|^ S denotes the Hilbert- Schmidt norm of M (the 

square root of the sum of the squares of its entries), and for specificity we write |M|^ 

for the operator norm of M with respect to the corresponding Euclidean metrics. Note 
that |M| hs S |M| 2 . 

The fundamental matrix solution Y(t,a) to (5.4) (cf (2.7)) has the form Y(t,a) = 
Y(t-a). We set 

Y y « sup |Y(t)| HS , (5.7) 

telO,t] 

and using {12, p. 149] we note that 

|Y(t) - Y(s)| HS - O (|t-s|), t,s c [0,T] . (5.8) 

Similarly F(t,s) (cf (3.6)) is a difference kernel, F(t,s) * F(t-s), and using (5.7) and (5.8) 
we have 

sup IFMIjjs » Y p < «*. (5.9) 

te[0,t] 

and 

|F(t) - F(s)|jjg = O (|t-s|), when t, sc (r, r. + ^) for some i, 0 5 i 5k- 1 (5.10) 

Lemma 5.1. Let P(t,a) denote the optimal feedback kernel for the problem (5.4) - 
(5.5). Then 
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|P(t,a) - P(t\a')| » O (It-t'l 4 |a-a'|). 
Proof. From Theorem 2.5 we have 
T 

P(t,a) * / K(t,s)Y(s-a) ds, 

a 

where 


s 

K(t,s) = F'(s-t) Q + J W_(t,a)F'(s-a)Qdo 
t 

Since sup |K(t,s)| <*> (cf Proposition 3.4 and (5.9)), it follows from (5.8) that 
t,s 

|P(t,a) - P(t,a')| < Kjla-a'l 

for some constant independent of t, a, a’. Thus using (5.7) it remains to show the 
existence of a constant such that 

J |K(t 2 ,s) - K(t r s)| ds S K 2 Itj-tJ. 

Now F(t) has only a finite number of jump discontinuities, so (5.9) and (5.10) imply 
J |P(s-t 2 ) - Ffs-tjllds » 0 (Itj-tJ). 

Hence, it is only necessary to verify that 

X |W_(t 2 ,a) - W_ (t lt a)| da » 0 (Itj-t^) (5.11) 

To this end let W* denote the operator with kernel W (t,o) and let X* « 

(l4W*)” l -I. Denote the kernel of X* by X (t,s). Also let W and X denote the adjoints of 
W* and X* respectively, with respective kernels W + (t,s) and X + (t,s). Now the 
factorization (recall Theorem 2.5) 

I 4 F*F » (NX*) (NX) 

implies 
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(5.12) 


X* » fF # F]_ 4 IF*FW]_, 

ft 

where F F has kernel H(t,s), 

T 

H(t,s) - J F’(a-t) QF (a-s)da. 
max(t.s) 

Note that 

|H(t 2 ,5) - H(t^,s)| = 0 (|X 2 -t 1 l 4 |s 2 - Sl |) (5.13) 

Now (5.12) is equivalent to 
T 

X (t,s) * H(t,s) 4 / H(t,9) W + (e,s)d6, a.e. s,t. 

s 

Because H(t,s) is continuous, X + (t,s), W + (t,s) are also continuous [11], and the equation 
above holds pointwise. Thus (5.13) and the triangle inequality imply 

|X (t 2 ,s) - XJtj.s}* = C (|t 2 -t x \) 

independently of s. Then (5.11) follows from the estimate above and the resolvent 
identity, 

s 

W_(t,s) 4 X_(t,s) 4 / X_ (t,6) W_(0,s)d0 * 0.// 
t 

Proposition 5.2. sup |P a (t,a) - P(t,a)| * 0 (1/n) 
t,a c(0,T] 

Proof. Using the identity 

A [NBA]" 1 - A V * Il4A V *BA V T l A Vt 
for A,B & 0, write the feedback kernel (from Theorem 4.1) as, 

P (t,a) » P' (t) [UU (t)]~ l Y (a) (5.14) 

q 
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(5.15) 


where 


S 

F(t) * A V ‘ lF‘(s -t)Q V * ... F'(T-t)Q 1 ^*]! 


U (t) « J F (a) F (o) da, 


(5.16) 


Y (a) « A 
~n 


i/, 

Q i(s Q -a) 


»/, 

Q x(T-a) 


Note the following bounds independent of n (recall (5.7) and (5.9)): 

‘"PlSn (a,l HS ST|Q ‘ / ‘ l Hs' r Y 


(5.17) 


(5.18) 


l£n (t) >HS S T 'hS Y F 


(5.19) 


sup |(I4U (t)) 1*1 
~n 2 


(5.20) 


Now fix (t , a ) c (0,T] x (0,T] with t £ a . Then there exist indices i, j such 
o o o o 

that (t , a ) c (s. , s.) x (s. , s.J, i £ ). From (5.8) we obtain 

O O 1- 1 I )- 1 j 


l2 n <V - X n s £1 ^ a |y( VV - t, V‘o ,1 hsJ VI iq ' / ' 1 


!£ A |Y(, k -a 0 . a 0 - .,) - Y<V« 0 > l^) V ' |Q V, | 
kit 


0 (1/n). 


(5.21) 


Similarly, expressing F(*) as a sum (as in (3.6)) and using (5.10) we obtain 
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Next observe that 


W 1 !, * KMW l i, |u n <t o ) la^cs,)) -1 ! 


S «W - H ‘V'hS 


(5.23) 


But from (5.16) and (5.19) it follows that 


|U (t ) - U (s.)|„ e » 0 (1/n) (5.24) 

~n o ~n i 'HS 

Putting (5.21) - (5.24) together with the bounds (5.18) - (5.20). and using the 
triangle inequality we get 


IP (t ,d ) - P (s., s.) 1*0 (1/n) 
n o o n l j a 


(5.25) 


Then using the fact that (4.13) holds for a. e. t, a. and that the estimate (5.25) 
holds for any (t , a ) c (s. ,, s.] x (s. ,iS.J, Lemma 5.1 and the triangle inequality 

O O t- 1 1 J-l J 

imply 


|P (t,a) - P(t,a)| = 0 (1/n) 

TL 2 


for all t, a e (0,T] x (0,TJ. H 

Since the idea behind the algorithm defined in Theorems 5.3 and 5.4 is based on a 
relatively simple observation that gets somewhat obscured in the notation and proofs of 
the theorems t it is worthwhile here to briefly remark on this motivating idea. 


If we return to the approximate gain defined in (5.3), it turns out that m the 

A a 

time -invariant case each U(s.) is a principal minor of U(0). Thus what we would like to 



A A 

do Is invert I 4 U(0) via a recursion in which all of the principal minors of I 4 U(0) are 
also inverted. Now by inspection U(0) can be identified with the covariance matrix of 

i/ a 

the random process {A 

Cj » A Z Q V ‘F((i-j)A)o., 

j-o 


where ECcj-Q.') * 6„I. In the signal processing literature (see for example [31]) it is 
shown that processes of this type admit "fast" filter implementations due to their 
"near" Toeplitz covariance matrix structure. This is precisely the property we exploit. 

Theorem 5.3. Define the symmetric N(n4l) xN(n4l) matrix 0 n with NxN block 
entries U?. where (for 0 S j S i 5 n) 

Ujj = £ A 2 Q V *((i-j) A4oA)F , (oA)Q V * 
o=0 


. „ ^ . • D N(n4l) 

Let Z denote the matrix on R 1 

n 


0 •••01 

1 ’ . 0 

b 10 


and for each p - 0,l,...n, let II „ denote the projection 

n,p 

n n,p *N(n4 iP' * ( V" x N(p4l)’ °’ , " 0V 

For p» q * 0, l,...n define (recall A * T/n) 

a . . a 1 N(n 4 l-p) v /A . 

P„ (Ap, Aq) « X Z r Y(Aq) 
n n t n-p n ~n 


where Y is defined in (5.17) end 
~n 


A A m 

x * n * n u n n j 

n»p n,p n,p 


A V, Q 1/t F(0) 

A^Q^FtAp) 

0 

0 
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Then 


max |P (Ap, Aq) - P (Ap,Aq) | = O (1/n) 
n 2 

p S q 

p,q e {0,1,.. .n} 

Proof. Recall the representation (5.14) for the optimal feedback kernel for cost 


F n <t ' a, *£k (t>ll 4 Hn (t, l % (a) - 


Thus we can write 

P (Ap, Aq) * X’(Ap) Y (Aq) 

II Ml 


(5.26) 


X(Ap) = [I+U (Ap)] 1 A Vl Q l/l F(0) 


A V, Q V, F((n-p)A) 


(5.27) 


Note that U (Ap) is an N(m 1) x N(m 1) symmetric matrix with NxN block entires 
U?. (Ap), i,j * 0, l,...n where (for 0 5 j 5 i) 

'"'tt) 


{ (j-P^A 

AQ V * J F((i-j)A40) F’(0)daQ V, f j * p 
0 ® otherwise 


For each p * 0,1,. ..n, let 


*/i */t 

A Q F(0) 


(5.28) 


X = [I 4 II U (0) n J 1 AViQV,F(Ap) 
n,p n,p ~n n,p 

0 


(5.29) 


Observing that 


’ Si.i ,0,i » * °' 1 ""* 
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it follows from (5.27) and (5.29) that 
X (Ato-pl) • Z n te -’ )N X n p 
Therefore (using (5.26)), 

P n (^..A,) - X n n _ p z“ ( " l - p, Y n (A,). 

Now, 

|P b (Ap.A,) - P n (Ap.A<l)| a S T' / '|Q' /l | j Y Y IX nin _ p -X n n _ p | j 

A 

(Here we have used (5.18).) But using (5.29) and the definitions of X and X, 

A 


|X nn -X nn I S T V, |Q V, |Y F |U n -U _<0))I HS . 
n,n-p n,n-p 2 r n na 


And, 


IU" U n <0,| KS * & |U S - Hi? '“Vs 1 * 
i,}*0 


(5.30) 


It follows from (5.26) that 

luS - H^'hs ■ 0 (a2) 

independent of i,j. Thus the right side of (5.30) is 0(A). Hence, 

max |P n (Ap,Aq) - P Q (Ap,Aq)| a « 0 (A), 

p.q 

The result follows from the estimate above and Proposition 5.2.// 

A Q 

Using the extended LWR algorithm 131] we will next show that can 

computed in a total of 0(n*) operations. The theorem above represents the approximate 
gain in the form 


P (Ap,Aq) » X* Z ^ (n4l * p) Y (Aq). 
n n,n-p n 


Now fixing p and letting q vary, these products can be viewed as a convolution. Since 
convolutions have fast implementations in O(nlogn) operations, it will follow then that 
(P(Ap,Aq)}^ j ^ can be computed in 0(n 2 logn) operations. 
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Theorem 5.4. {X F n can be computed in 0(n ) operations. 
n«p p=u 


Proof. By definition X satisfies 

n,p 


A n a 


[i 4 n irn jx „ = n f «» 

‘ n,p n,p n,p n,p~n 


(5.31) 


We will exhibit a fast recursion for solving (5.31). 


A simple calculation shows that 


U? i fJ<|l - U" fj • A J Q 1/, F((i4l)A)F'((j+l)A)Q ,/ * 

Therefore the matrix 6(U n ) with block entries ^ j 4 i ” ^ can b* 

written 


dOJ 11 ) . A 3 


Q V *F(A) 


(F'(A)Q 


V* 


F'(nA)Q V *J 


|j3 V *F(nA)j 

Consequently, rank (6(U**)) 5 N « n. Furthermore (and importantly (31]) 6(11**) has 
a factorization of the form 


6(U n ) « D £ D' 

with D an nNxN matrix and £ an NxN signature matrix. In fact, this factorization is 
easily done by inspection: 


D » A 


»/. 

Q *F(A) 



i/ • 

Q '*F(nA) 


£ * I. 
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Therefore the LWR algorithm (31, p.6551 can he used to recursively compute the 


solutions M_ to 
P 


[i 4 n n u“n im o 

n,p n,p p 


U 


0.P41 


u* 1 

P,P41 


0 

6 


(5.32) 


p » 0, i,...n in a total of 0 (N n ) operations. With these solutions in hand we can argue 
as in the scalar Toeplitz case (see for example 132]) to recursively solve the system 
(5.31). These details are supplied below. 


So now consider solving the problem 


U 4 n u n n ix fl . n f (O), 

n,p n,p n,p n«p ^*n 


(5.33) 


given X . and M . (from (5.32)). 

n,p- 1 p- 1 


Let F denote the N(p4 l)xU matrix composed of the first N(p4 1) rows of £^0), 
end let f denote the NxM matrix composed of the Np4l through N(p4l) rows of F (0). 

p ~u 

Thus we can write 


P+l 


Let S denote the N(p4 1) x N(p4 1) matrix composed of the northwest comer of I 4 U , 
P 

and let r denote the NxN southeast comer of R . Also let M e pN(p4l)xN denote 
P P ~P 

the matrix consisting of the first N(p4l) rows of M , and similarly let N denote the 

Pa P 


N(p4 l)xM matrix consisting of the first N(p4 1) rows of X 


n,p 
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Now define 


n , * 

P+1 

and consider the equation 


— — 




- — 

R p Vi 



s 

F 

P 

“p.l V*1 




V* 


Note that 


U 


o,p+l 


ti“ 

L P.P+1 


N 


P+1' 


From the top of the equations in (S.34) we obtain 


U=R l F-R l * . v * N - M v. 
P P P P+1 P ~P 


Substituting this into the bottom equation, we get 


[r , - u . M ] u ■ f - u ,N 
P+ 1 P+1 ~P P p+1 p 


Noting that 


0 < 


~P 


*NxN 


R u - 

P P+1 


*P+ 1 r p+ 1 


I 

0 


-M 

~P 

*NxN 


(5.34) 


(5.35) 


(5.36) 
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0 


R. 


0 r ,-M u , . 

p+l ~p p+lj 


**NxN = identity matrix), it follows 


V = ^ r p-fl ” U p+1 ~p^ ^p U P +1 P 


N } 


(5.37) 


Thus, given and N , can be computed from (5.36) and (5.37) in 0(N a p) 

A ^ 

-operations (p » N). Hence, in particular C X Q p Jp = Q in (5.31) can be computed in a 
recursive manner in a total of 0(n 2 ) operations./' 


In [33] infinite dimensional Chandrasekhar equations are derived for 
time- invariant hereditary systems without control delay. A fast algorithm based on 
approximating the Chandrasekhar equations is obtained and is shown to possess the 
same convergence properties as reported in [10]. Although there are no direct 
connections between the algorithm we developed here and the one in [33], there are 
some general connections between Chandrasekhar equations and the inversion of near 
Toeplitz systems [31]. 


We note that stability of the algorithms of this section with respect to the data 
Y(t), F(t) is easily demonstrated. For suppose Y(*) and F (•) were replaced by Y £ (•) 
and F £ ( *) with 

sup |Y(t)-Y (t)|, |F(t) - F (t)| < c. 
t c [0,T] 

Using the obvious notation, we obtain the corresponding error estimates, 

» t 


and 


max '^i - ^HS * 
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*11 independent of n. Substituting these errors into the appropriate places in Theorem 
5.3, it follows that an 0(c) perturbation in the data Y(t), F(t) yields an 0(e) perturbation 
in the estimate of the feedback kernel (all of these estimates in the sup norm). 

Another feature to note is that although the implementation defined by Theorem 
5.3 and Theorem 5.4 is recursive backward in time for the computation of the feedback 
kernel (just as one would suspect -e.g. the Riccati equation is solved backward in time), 
in terms of the algorithm’s utilization of the fundamental matrix Y(t), it is actually 
forward in time. Thus any approximating scheme for the computation of Y(t) can be 
readily incorporated into the algorithm. This remark is also true for the 
implementation of Theorem 4.1 introduced in the beginning of this section. 

One final remark concerning the algorithm is that in the event that the point 

delays {r.}^ , do not correspond to a subset of the nodes {s,)f _ the estimate in (5.22) 
i 1 = 1 x x-u, 

is only 0 (l/Vn). Thus the estimate in Theorem 5.3 would also be 0 (1/Vn). Of course 
Theorem 4.1 has the flexibility to place nodes anywhere, and it may be possible to 
recover the stronger convergence by considering algorithms arising from different 
discretization strategies. 

We conclude this section with the following simple scalar example: 

a 

min J(u,x) * / |x(t)| 2 4 |u(t)| 2 dt 
o 

subject to the constraint 

x(t) * x(t) 4 x(t- 1) 4 u(t) 

Since we are seeking the optimal feedback kernel P(t,a), it is not necessary to prescribe 
an initial condition above. 

The algorithm described in the beginning of the section based on the 

A 

approximation (5.3) and the recursive inversion of I 4 U(s^) via the "matrix inversion 


57 


lemma", was programmed using a few lines of Fortran code. Because the fundamental 
solution Y(t) to the differential equation is easily derived on the interval [0,2], the 
exact solution was used in the algorithm. (Recall that an 0(c) error in the 
approximation of Y( • ) results in an 0(c) error in P(t,a).) 

Discretizations with mesh width A * .025. .01. .005. were considered. The 
results for this problem coincided fairly well with the theory. We observed essentially 
linear (uniform) convergence of the feedback kernels as predicted. Tables 5.1 - 5.3 
contain these results. Table 5.4 contains values of the kernel obtained via the Riccati 
equation approach using linear spline approximations of the history space. The gain 
computed using these approximations appeared to have converged to two (and in some 
instances three) significant figures. The author is indebted to Professor J. S. Gibson for 
providing these values for comparison. 


Table 5.1 
A • .025 

C 


Kt,*> 

0.0 

0.3 

t*0.0 

2.7881 

2.4237 

t+O.I 

2.3699 

2.0632 

t+0.2 

2.0210 

1.7566 

t+0,3 

1.7237 

1.4972 

t+0.4 

1.4712 

1.2790 

t-KJ.3 

1.2376 

1.0972 

t+0.6 

1.0780 

0.9411 

t+0.7 

0.9280 

0.8037 

t+O.8 

0.8040 

0.6822 

t+O.9 

0.7029 

0.5472 

t+1.0 

0.6221 

0.4775 


1.0 1.5 


1.7011 

0.7462 

1.4473 

0.5799 

1.2222 

0.4252 

1.0213 

0.2789 

0.8406 

0.1381 

0.6765 

0.0000 

0.5258 

0.0000 

0.3855 

0.0000 

0.2528 

0.0000 

0.1252 

0.0000 

0.0000 

0.0000 


Table 5.2 
d * .01 
t 


m.fc) 

0.0 

0.5 

t+0.0 

2.7300 

2.3951 

t+fl. 1 

2.3279 

2.0378 

t+0.2 

1.9844 

1.7341 

t+0.3 

1.6919 

1.4770 

t+o.4 

1.4335 

1.2609 

t+0.3 

1.2335 

1.0808 

, t+0.6 

1.0570 

0.9270 

t+0.7 

0.9096 

0.7916 

t+0.8 

0.7878 

0.6721 

t+0.9 

0.6886 

0.5659 

t+1.0 

0.6093 

0.4710 


1.0 

1.3 

1.6943 

0.7521 

1.4110 

0.SS4S 

1.2165 

0.4285 

1.0162 

0.2810 

0.8361 

0.1391 

0.6727 

0.0000 

9.5228 

0.0000 

0.3832 

0.0000 

0.2513 

0.0000 

0.1244 

0.0000 

0.0000 

0.0000 
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Table 5.3 
A - .005 

t 


a 


T(t»a) 


o.o 


*40.0 

*40.1 

*40.2 

*40.3 

*40.4 

*40.5 

*40.6 


2.7140 

2.3140 

1.9724 

1.6814 

1.4344 

1.2556 

1.0500 


*40.7 

*40.8 

*40.9 

t4l.O 


0.9036 

0.7825 

0.6839 

0.6051 


0.5 1.0 


2.3855 

1.6919 

2.0293 

1.4388 

1.7265 

1.2145 

1.4703 

1.0144 

1.2549 

0.8346 

1.0753 

0.6714 

0.9223 

0.5217 

0.7876 

0.3824 

0.6887 

0.2508 

0.5631 

0.1241 

0.4688 

0.0000 


1.5 


0,7541 

0.5859 

0,4295 

0.2817 

0.1394 

0.0000 

0.0000 

0.0000 

0,0000 

0.0000 

0.0000 


Table 5.4 

Spline Approximation 


?(*.•) 

0.0 

0.5 

t 

1.0 

1.5 

t40.0 

2.6915 

2.3742 

1.6860 

0.7491 

t40.l 

2.2986 

2.0177 

1.4355 

0.5894 

t40.2 

1.9639 

1.7215 

1.2165 

0.4366 

t40.3 

1.6749 

1.4685 

1.0150 

0.2776 

t40.4 

1.4243 

1.2489 

0.8284 

0.1257 

t40.5 

1.2114 

1.0605 

0.6658 

0.0217 

*40.6 

1.0445 

0.9234 

0.5230 

0.0005 

*40.7 

0.8994 

0.7862 

0.3881 

0.0032 

*40.8 

0.7725 

0.6620 

0.2404 

0.0005 

*40.9 

0.6786 

0.5567 

0.1079 

0.0004 

*41.0 

0.6013 

0.4690 

0.0323 

0.0000 


v- 
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CONCLUDING REMARKS . Our focus has been on the control problen for tbe REDE on 
• finite interval. A natural question which arises is whether tbe approach 
can be adapted to treat tbe infinite tine problen. Several aspects of tbe 
analysis can probably be extended to this case. (Sone of these extensions 
appear nontrivial bovever). For example, although the factorization results 
and projection integrals discussed in Section 2 are based on Eilbert-Schmidt 
assumptions (which are not valid for problens on the sen i- infinite interval)# 
for stable tine-invariant systems the infinite-tine factorization counterpart 
to (2.10 is the classical Viener-Hopf factorization. Thus we expect that the 
optimal control lav can at least be formulated in the same manner as in 
Theorem 2.5 for the infinite-tine problen. Any approximation scheme devised 
thereafter would necessarily have to consider approximating solutions to a 
Tiener-Hopf equation. In spirit such an approach night nahe contact with some 
vorh of Davis [4]. 

Fundamental to the analysis of this report was the exploitation of two 
properties of the factorization approach of [24], First, that the entire 
senigroup never needs approximation, and second that connections between the 
open loop system and feedback kernel are fairly transparent via the 
factorization. These properties are not unique to the RFDE control problen. 
and as a final consent we remark that the general approach should lesd to 
approximate feedback lavs in other settings as well. 
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FOOTNOTES 


1. Hie kernel K(t.s) of en integral operator I is seniseptrable if there exist 
aistrix functions Hj[(t) end Gi(a)» i “ I»2. snch thst K(t.s) “ Hi(t)Gi(*) 
for s < t, and E(t.s) ■ HjftlGjCt) for » i t. The kernel, is separable if 
we can choose “ Hj and Gj ■ Gj* la this case the associated operator E 
has finite rank. 

2. It is nnfortnnate that although this convergence is obtained using a 
first-order quadrature scheae. it is aot a priori evident that eaploying a 
higher order scheae would result in iaproved convergence. The stuabling 
block is that the convergence analysis ve have used is based on properties 
of the fundaaental matrix, vhich is generally only absolutely continuous, 
and thus precludes any straightforward extensions. 


Y 
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